AITopics | safety gym

Collaborating Authors

safety gym

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

debd0ae2083160397a22a4a8831c7230-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 09:02:26 GMT

ablation, constraint, safety budget, (14 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.70)

Add feedback

9a8eb202c060b7d81f5889631cbcd47e-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 23:46:46 GMT

artificial intelligence, hidden layer, machine learning, (19 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.53)

Add feedback

Appendices

Neural Information Processing SystemsAug-19-2025, 11:55:31 GMT

Note that this safe RL problem is less general than the standard formulation of safe RL. The authors introduce a teacher-student hierarchy. To learn the teacher's policy the following constraints are followed: a1 The unsafe set is contained in the intervention set D D The teacher learns when to intervene and to switch between different interventions. A1.2 RL with probability one constraints We have introduced the safety state to the environment as follows: s First, we discuss our design for the PI controller and discuss the necessary parts for it. The proportional part delivers brute force control by having a large control magnitude for large errors, but it is not effective if the instantaneous error values become small.

ablation, constraint, safety budget, (14 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.70)

Add feedback

A Hyper parameters and finer experimental details

Neural Information Processing SystemsAug-17-2025, 05:24:41 GMT

The hyper-parameters used for our algorithm are shown in Table 1. The'Point' robot has steering and throttle as action space while'Car' robot has differential control. We use Performance Ratio (PR) threshold of 66%. Minimum 4 GB GPU space is required for running both the model based approaches. We compare how model learning validation loss varies in Safe RL setting as opposed to unconstrained RL one.

artificial intelligence, hidden layer, machine learning, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.53)

Add feedback

Handling Cost and Constraints with Off-Policy Deep Reinforcement Learning

Markowitz, Jared, Silverberg, Jesse, Collins, Gary

arXiv.org Artificial IntelligenceNov-30-2023

By reusing data throughout training, off-policy deep reinforcement learning algorithms offer improved sample efficiency relative to on-policy approaches. For continuous action spaces, the most popular methods for off-policy learning include policy improvement steps where a learned state-action ($Q$) value function is maximized over selected batches of data. These updates are often paired with regularization to combat associated overestimation of $Q$ values. With an eye toward safety, we revisit this strategy in environments with "mixed-sign" reward functions; that is, with reward functions that include independent positive (incentive) and negative (cost) terms. This setting is common in real-world applications, and may be addressed with or without constraints on the cost terms. We find the combination of function approximation and a term that maximizes $Q$ in the policy update to be problematic in such environments, because systematic errors in value estimation impact the contributions from the competing terms asymmetrically. This results in overemphasis of either incentives or costs and may severely limit learning. We explore two remedies to this issue. First, consistent with prior work, we find that periodic resetting of $Q$ and policy networks can be used to reduce value estimation error and improve learning in this setting. Second, we formulate novel off-policy actor-critic methods for both unconstrained and constrained learning that do not explicitly maximize $Q$ in the policy update. We find that this second approach, when applied to continuous action spaces with mixed-sign rewards, consistently and significantly outperforms state-of-the-art methods augmented by resetting. We further find that our approach produces agents that are both competitive with popular methods overall and more reliably competent on frequently-studied control problems that do not have mixed-sign rewards.

experiment, international conference, opac 2, (13 more...)

arXiv.org Artificial Intelligence

2311.18684

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > Promising Solution (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Safe Reinforcement Learning in a Simulated Robotic Arm

Kovač, Luka, Farkaš, Igor

arXiv.org Artificial IntelligenceNov-28-2023

Reinforcement learning (RL) agents need to explore their environments in order to learn optimal policies. In many environments and tasks, safety is of critical importance. The widespread use of simulators offers a number of advantages, including safe exploration which will be inevitable in cases when RL systems need to be trained directly in the physical environment (e.g. in human-robot interaction). The popular Safety Gym library offers three mobile agent types that can learn goal-directed tasks while considering various safety constraints. In this paper, we extend the applicability of safe RL algorithms by creating a customized environment with Panda robotic arm where Safety Gym algorithms can be tested. We performed pilot experiments with the popular PPO algorithm comparing the baseline with the constrained version and show that the constrained version is able to learn the equally good policy while better complying with safety constraints and taking longer training time as expected.

algorithm, robotic arm, safe reinforcement learning, (11 more...)

arXiv.org Artificial Intelligence

2312.09468

Country:

North America > United States > New York (0.05)
Europe > Slovenia > Central Slovenia > Municipality of Ljubljana > Ljubljana (0.05)
Europe > Slovakia > Bratislava > Bratislava (0.05)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Towards Safe Reinforcement Learning with a Safety Editor Policy

Yu, Haonan, Xu, Wei, Zhang, Haichao

arXiv.org Artificial IntelligenceJan-28-2022

We consider the safe reinforcement learning (RL) problem of maximizing utility while satisfying provided constraints. Since we do not assume any prior knowledge or pre-training of the safety concept, we are interested in asymptotic constraint satisfaction. A popular approach in this line of research is to combine the Lagrangian method with a model-free RL algorithm to adjust the weight of the constraint reward dynamically. It relies on a single policy to handle the conflict between utility and constraint rewards, which is often challenging. Inspired by the safety layer design (Dalal et al., 2018), we propose to separately learn a safety editor policy that transforms potentially unsafe actions output by a utility maximizer policy into safe ones. The safety editor is trained to maximize the constraint reward while minimizing a hinge loss of the utility Q values of actions before and after the edit. On 12 custom Safety Gym (Ray et al., 2019) tasks and 2 safe racing tasks with very harsh constraint thresholds, our approach demonstrates outstanding utility performance while complying with the constraints. Ablation studies reveal that our two-policy design is critical. Simply doubling the model capacity of typical single-policy approaches will not lead to comparable results. The Q hinge loss is also important in certain circumstances, and replacing it with the usual L2 distance could fail badly.

constraint reward, safe reinforcement learning, seditor, (12 more...)

arXiv.org Artificial Intelligence

2201.12427

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > California > Santa Clara County > Cupertino (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

The limitations of AI safety tools

#artificialintelligenceSep-28-2021, 20:20:24 GMT

The Transform Technology Summits start October 13th with Low-Code/No Code: Enabling Enterprise Agility. In 2019, OpenAI released Safety Gym, a suite of tools for developing AI models that respects certain "safety constraints." At the time, OpenAI claimed that Safety Gym could be used to compare the safety of algorithms and the extent to which those algorithms avoid making harmful mistakes while learning. Since then, Safety Gym has been used in measuring the performance of proposed algorithms from OpenAI as well as researchers from the University of California, Berkeley and the University of Toronto. But some experts question whether AI "safety tools" are as effective as their creators purport them to be -- or whether they make AI systems safer in any sense. "OpenAI's Safety Gym doesn't feel like'ethics washing' so much as maybe wishful thinking," Mike Cook, an AI researcher at Queen Mary University of London, told VentureBeat via email.

constraint, reinforcement, safety gym, (14 more...)

#artificialintelligence

Country:

North America > Canada > Ontario > Toronto (0.55)
North America > United States > California > Alameda County > Berkeley (0.25)

Industry: Information Technology (0.32)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.89)

Add feedback

8 Best Alternatives To OpenAI Safety Gym

#artificialintelligenceJul-22-2021, 18:45:20 GMT

Two years ago, Open AI released Safety Gym, a suite of environments and tools for measuring progress towards reinforcement learning agents that respect safety constraints while training. Safety Gym has use cases across the reinforcement learning ecosystem. The open-source release is available on GitHub, where researchers and developers can get started with just a few lines of code. In this article, we will explore some of the alternative environments, tools and libraries for researchers to train machine learning models. AI Safety Gridworlds is a suite of reinforcement learning environments illustrating various safety properties of intelligent agents.

reinforcement, reinforcement learning, safety gym, (12 more...)

#artificialintelligence

Country: North America > Canada > Ontario > Toronto (0.16)

Industry:

Education (0.75)
Banking & Finance > Trading (0.32)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.40)

Add feedback

OpenAI Open Sourced this Framework to Improve Safety in Reinforcement Learning Programs

#artificialintelligenceDec-6-2020, 20:22:36 GMT

I recently started a new newsletter focus on AI education. TheSequence is a no-BS( meaning no hype, no news etc) AI-focused newsletter that takes 5 minutes to read. The goal is to keep you up to date with machine learning projects, research papers and concepts. Safety is one of the emerging concerns in deep learning systems. In the context of deep learning systems, safety is related to building agents that respect safety dynamics in a given environment.

agent, reinforcement, safety gym, (12 more...)

#artificialintelligence

Industry: Education (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.44)

Add feedback